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Abstract 

This paper introduces a class of games, called unit-sphere games, where strategies are real 
vectors with unit 2-norms (or, on a unit-sphere). As a result, they can no longer be interpreted 
as probability distributions over actions, but rather be thought of as allocations of one unit of 
resource to actions and the multiplicative payoff effect on each action is proportional to square- 
root of the amount of resource allocated to that action. The new definition generates a number 
of interesting consequences. We first characterize sufficient and necessary conditions under 
which a two-player unit-sphere game has a Nash equilibrium. The characterization effectively 
reduces solving a unit-sphere game to finding all eigenvalues and eigenvectors of the product 
of individual payoff matrices. For any unit-sphere game with non-negative payoff matrices, 
there always exists a unique Nash equilibrium; furthermore, the unique equilibrium is efficiently 
reachable via Cournot adjustment. In addition, we show that any equilibrium in positive unit- 
sphere games corresponds to approximate equilibria in the corresponding normal-form games. 
Analogous but weaker results are extended to positive n-player unit-sphere games. 


1 Introduction 

Consider the following two games. 

Example 1. Protecting Manhattan. Two police stations try to protect Manhattan (could be visual¬ 
ized as a rectangle) from a two-dimensional terrorist attack. Station A is responsible for protecting 
all the streets, i.e., the horizontal paths across the rectangle; while station B is responsible for 
protecting all the avenues, i.e., the vertical paths. Each police station has one unit of police force, 
distributes optimally its force among its paths, and derives a positive utility ufj (resp. uf^) from 
successfully protecting each subway station Sij, namely, the intersections of street i and avenue j. 
The probability of successfully protecting a subway station is ^Jaibj, where a* and bj are the amount 
of police force station A and B allocates to street i and avenue j respectively. 

Example 2. Graphical ads for combinatorial queries. Consider a situation where a user submits a 
query (say, “Yellow Stone national park”) to a travel website. The query triggers interests from two 
complementary advertisement agencies: one has a collection of hotel ads and the other airline ads. 
The website allocates two regions on its homepage, each with one unit of area, to the two groups of 
ads respectively. Agency A, with a set of hotel ads, tries to fill in the first region with graphical ads 
of hotels, similar for agency B and airline ads for the second region. Each agency derives positive 
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utility (resp. uf^) if the user successfully purchases a combination of (hotel i, airline j). Note 
that here agency A’s utility may depend on j since payment rule may involve both i and j. The 
probability that the user purchases the combination is y/aibj, where Oj and bj are the areas A and 
B allocate to hotel ad i and airline ad j respectively (so y/df is roughly the height or width of ad i). 

At first glance, both games resemble a general version of Blotto game [8], thus nontrivial to 
solve. However, a close scrutiny reveals an interesting pattern: both games be modeled as normal- 
form games where players need to allocate one unit of resources to actions and the payoff effect 
on each action is proportional to the square-root of amount of resource allocated to that action. 
As we shall see, both examples are instances of positive unit-sphere games, which possess unique, 
learnable pure Nash equilibria. 

2 Unit-sphere games 

Most of the paper deals with 2-player unit-sphere games. In Section [H this definition is extended 
to accommodate any number of players. 

Definition 1. A two-player unit-sphere game (USG) is defined by two matrices Ax B, where 

• A is an m X n payoff matrix for player 1, 

• B is an n X m payoff matrix for player 2. 

A unit-sphere strategy x for player 1 is a column vector of real numbers such that x G 
ll^^lb = 1) while a strategy y for player 2 is a column vector of real numbers such that 

X G M"’, ||y ||2 = 1. Given a strategy profile {x,y), the utility obtained by player 1 is x'^Ay while 

the utility of player 2 is y^Bx. Other game-theoretical notions, such as best response and Nash 
equilibrium, follow standard definitions. 

Mathematically, the above definition is a 2-player normal-form game except for the definition 
of strategy, where the restriction of unit Li-norm is now replaced by unit L 2 -norm. In other words, 
each unit-sphere strategy is a point on a unit sphere, rather than a probability distribution. This 
also implies that both x and y can be negative on some coordinates, as long as they are on a unit 
sphere. 

It is important to note that a USG can just be thought of as a standard normal-form game where 
each pure strategy corresponds to a unit-sphere strategy and there are infinite many such strategies. 
From this perspective, the characterization theorems (Theorems 13.11 13.31 13.4p are sufficient and 
necessary conditions for a large class of games to have (unique) pure Nash equilibria. 

In this paper, we do not consider randomized unit-sphere strategies, for the following reasons. 
First of all, a randomization over unit-sphere strategies is no longer a unit-sphere strategy, thus 
not well-defined under our new definition. Secondly, it is not hard to see that such a randomized 
strategy has a L 2 -norm less than 1 and is always utility-dominated by some unit-sphere strategy. 
Last but not least, we are interested in comparing unit-sphere strategy (which is somewhat mixed) 
to standard mixed strategy, in terms of existence and computation efficiency of Nash equilibrium. 
Adding another level of mixture makes the comparison less interesting. 

One can also view players in a USG as risk averse agents whose payoffs, when facing a lottery 
outcome, are not linear expectations of their utilities on deterministic outcomes in the lottery, but 
concave expectations (in our case, a square-root function). In general, games with concave utility 
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agents possess a mixed Nash equilibrium and it is in general PPAD-hard to compute such an 
equilibrium [U Theorem 1]. Our model and results does not follow from Fiat and Papadimitriou in 
that, firstly, we allow for negative strategies, i.e., x and y can have negative entries, thus the whole 
strategy set is not necessarily convex, precluding a Nash style proof; secondly, when restricting to 
non-negative strategies, under the additional assumption of positive payoff matrices, we are able to 
show that a unique Nash equilibrium exists and easy to compute. Readers are referred to [4j and 
the references therein for an introduction on non-linear expectations. 

Finally, in our definition, adding a positive constant to each payoff matrix cell no longer yields 
an equivalent USG. Intuitively, when adding a large constant to a player’s payoff function, the 
player has more incentive to distribute her resource evenly among pure strategies. So, it loses 
generality to restrictions on positive payoff matrices. On the other hand, USGs are scale-invariant 
in the sense that multiplying a constant to a player’s payoff function yields an equivalent USG. 


3 Nash Equilibria in USGs 

In this section, we characterize sufficient and necessary conditions for Nash equilibrium (NE) to 
exist in USGs. In particular, equilibrium exists in all the USGs with positive payoff matrices. It is 
unique and efficiently computable, via a well-known learning process known as Cournot adjustment. 


3.1 Structure of NE in USGs 


Let us now consider NE in a USG Ax B. It is easy to see that the utilities of the two players are 

ui = x'^Ay = \\Ay \\2 cos a, 

U 2 = y"^Bx = ||il3:||2 cos /3, 

respectively, where a denotes the angle between x and Ay and /3 denotes the angle between y and 
Since both x and y are on the unit-sphere, a strategy profile (x, y) forms an NE if and only if 

X = arg max A?/ a = 0 <;=^ \x = Ay, 

x' 


and 


y = argmaxy'^Rx 
y' 


/3 = 0 


/ry = Bx, 


where A = ||Ay|| 2 , y = ||Rx|| 2 . 

By this observation, we have a necessary condition of existence of NE for two-player USGs. 


Lemma 1. Let A and B be the matrices of a USG. If AB and BA do not share a nonnegative 
eigenvalue, the USG does not have an NE. 


Proof. We show that an NE exists only if AB and BA share a nonnegative eigenvalue. Gonsider 
payoff matrices A and B. Eor a NE profile (x,y). 


BXx = BAy ^ Xfiy = BAy, 

AXy = ABx Ayx = ABx. 

In other words, x is an eigenvector of AB with eigenvalue Ay, and y is an eigenvector of BA with 
eigenvalue Ay. □ 

^When Ay = 0 (resp. Bx = 0), one may set a (resp. (3) arbitrarily. 
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Since AB and BA have the same set of eigenvalues, the following theorem characterizes the 
sufficient and necessary condition for an NE to exist in any two-player USG. 

Theorem 3.1. Let A and B be the matrices of an USG. There exists an NE for the USG if and 
only if AB (or BA) has a nonnegative eigenvalue A > 0. 

Proof. The only-if direction follows from Lemma [TJ We now prove the if direction. Assume AB 
has a nonnegative eigenvalue A with eigenvector x such that ||x ||2 = 1- 

• If Bx 7 ^ 0, let y = {x,y) is an NE for the game, because ~ = 

Bx. 

• If Bx = 0, y 7 ^ 0 can be chosen such that either Ay = kx for some k > 0, when det A 7 ^ 0, 
or Ay = 0, when det A = 0. Also we assume ||y ||2 = 1. Again {x,y) is an NE for the game, 
because kx = Ay for some k > 0, and the utility of player 2, y^Bx, is always 0. 


□ 

As stated in Theorem EH to solve an USG A x B, i.e., to find all NEs or to ensure that no 
NE exists, it is equivalent to calculate all eigenvalues of AB and the corresponding eigenvectors. 
Solving USGs is therefore effectively reduced to the eigenvalue problem, for which one may refer to 
the standard Singular value decomposition. We refer readers to [9] for more efficient algorithms. 

3.2 Positive USGs 

We now focus on a general class of USGs where there always exists a unique NE. 

Definition 2. A USG Ax B is positive if A,B > 0, and any strategy satisfies x,y > cH. 

Positive USGs (PUSGs) have many interesting properties that general USGs do not necessarily 
possess. Before we state these properties, we need the following lemma from linear algebra. 

Lemma 2. (Perron-Frobenius flf): For any sguare matrix A > 0, we have 

• A has an eigenvalue A > 0. Moreover, for any other eigenvalue y of A, |A| > |y|. We call A 
the Perron-Frobenius value, or spectral radius of A, denoted as X = p{A). 

• The eigenvalue A has algebraic and geometric multiplicity one. 

• There is an eigenvector x > 0 of A with an eigenvalue of X. 

• The only positive eigenvectors of A have the form kx for some k > 0. Moreover, all positive 
eigenvectors have corresponding eigenvalue X. 

Lemma 3. For payoff matrices A > 0, B > 0, AB and BA share at least one positive eigenvalue, 
which is their spectral radius. 

^We say a matrix j 4 > 0 if Aij > 0 for all and a vector x > 0 if Xi > 0 for all i. 


4 





Proof. Clearly, AB and BA are square matrices. Let x > 0 be an eigenvector of AB with eigenvalue 
A = p{AB) > 0, whose existence is guaranteed by Lemma El Note that 

BA{Bx) = B{ABx) = \{Bx). 

Namely, Bx is an eigenvector of BA with eigenvalue A. It follows that AB and BA share the same 
positive eigenvalue A > 0. Now suppose p{BA) > A. By the same argument, we can see that p{BA) 
is an eigenvalue of AB, a contradiction. □ 

With Lemma [3l we are now able to derive two NEs for all PUSGs. 

Theorem 3.2. There exists two NE {xi,yi), {x 2 ,y 2 ) for any PUSG, where 

• xi > 0 is the unit eigenvector of AB with eigenvalue A = p{AB). 

where the utilities of the players obtained from (xi,yi) are • 

• ?/2 > 0 is the unit eigenvector of BA with eigenvalue A = /9(i3T)ll 

Ay2 

• ^2 = p^- 

where the utilities of the players obtained from {x 2 ,y 2 ) are ^||^?/ 2 || 2 ; 11 ^^ 2112 )' 

Proof. We prove for the case of (xi,yi). The case of {x 2 ,y 2 ) is symmetric. By Lemma [3l it is 
always feasible to pick xi as stated in the theorem. For player 1, 

ui{x',yi) = 


For player 2, 


x^'^Ayi = 


x'^ABxi 


\Bx 


12 


A 


I.BX1II2 

A 




A 


\\Bx 


1||2 


T 

-Xi Xi 


\Bx 


1||2 


U2{xi,y') = y'^Bxi = y''^\\Bxi\\2yi 
< \\Bxi\\ 2 yiyi = IIBX1II2. 

In other words, neither player has profitable deviation in (xi,yi). □ 

Theorem 13.21 derives a pair of symmetric NEs for any PUSG. One might wonder whether the 
two NEs are identical? This is indeed the case. We dedicate Subsection 13.31 to this result. 

In fact, there is a symmetric NE in a PUSG if the payoff matrices satisfy certain additional 
conditions. Before we state these conditions, we need the following technical lemma. 

Lemma 4. For sguare matrices A > t), B > such that AB = BA, A and B share the same 
one-dimensional eigenspace of spectral radius. 

^Recall that p{AB) = p{BA) 
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Proof. Let A = p{A), x > 0 be an eigenvector of A whose corresponding eigenvalue is A, then 

A{Bx) = B{Ax) = \{Bx), 

namely Bx is an eigenvector of A whose eigenvalue is A. By Lemma [2l the eigenspace of A is 
one-dimensional, which implies that Bx = px for some p. Again by Lemma O x belongs to the 
eigenspace of the spectral radius of B, or equivalently /i = p{B). □ 


If AB = BA, the corresponding PUSG has a symmetric NE. 

Theorem 3.3. There is a symmetric NE (x,x) for any PUSG with square payoff matrices Ax B 
such that AB = BA. The NE utilities are {p{A),p(B)). 

Proof. Let x > 0 be the unit eigenvector of A whose corresponding eigenvalue is p(A) (and therefore 
the unit eigenvector of B whose eigenvalue is p{B)). For player 1, 

ui(x^,x) = x''^Ax = p{A)x''^ X 

< p{A)x'^x = p{A). 


For player 2, 


U 2 {x' ,x) = x'^ Bx = p{B)x'^x 
< p{B)x'^x = p{B). 

Neither player has a profitable deviation in (x,x). 


□ 


3.3 Uniqueness of NE in PUSGs 

One of the most appealing properties of all PUSGs is that they have unique NE. 

Theorem 3.4. Any PUSG has an unique NE. 

Proof. Let (x,y) be an arbitrary NE of PUSG with payoff matrices A and B, whose existence has 
been established in Theorem 13.21 By Lemma [H 

3A > 0, p > 0, s.t. ABx = Ax, BAy = py 

We will show that A is the spectral radius of AB, and x is the corresponding positive unit eigen¬ 
vector. The case of y is symmetric. Assume A 7 ^ p{AB). By Lemma [2l there must be some i E [n] 
such that Xj = 0, since there are no other positive eigenvectors beside those of the spectral radius. 
Note that A > 0, AB > 0. 

0 = Xxi = {ABx)i = ^^{AB)ijXj 
j 

> min(AB)jj||x||i > 0, 

a contradiction. Therefore A = p{AB). Again by Lemma [2l the eigenspace of A is one-dimensional. 
Namely x is the unique positive eigenvector of A such that ||x ||2 = 1. The same argument works 
for y. To conclude, we prove that (x, y) is the unique NE. □ 

Corollary 1. Any PUSG has an unique NE, which has the form stated in Theorem \3.2[ Moreover, 
the two symmetric NEs in Theorem \3.^ are identieal. 

Next, we show the unique NE of a PUSG can be efficiently found via a natural learning process. 
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4 Solving PUSGs via Cournot adjustments 

In this section, we show that the unique NE of any PUSG can be resulted when both players follow 
a well-known learning process called Cournot adjustments . This is remarkable property since it 
states that players can learn to play NE even without any information of each other’s payoff matrix. 

4.1 Cournot adjustments 

Define Cournot adjustments as follows, 

1. In the first round, each player i plays any positive strategy s^ > 0. 

2. In round t, each player i observes s5_j, the strategy of player —i. 

3. In round t + 1, each player i plays her best response against Namely 

t+l ^ 

\\A^sU\\2^ 

4. Iterate until no player updates her strategy. 

Cournot adjustments define a natural protocol for players to learn to play a game over time. 
It is appealing when players do not know others’ payoff matrices and for whatever reason that 
the players cannot perform equilibrium computation upfront. It is known that, for any standard 
games, a carefully designed better response dynamics can converge to some mixed-strategy Nash 
equilibrium (aka. Nash’s proof), but may take exponential number of rounds. In the following, 
we show that this procedure thoroughly exploits the properties of PUSGs and finds efficiently the 
unique NE for any PUSG in logarithmic number of rounds with respect to initial error. 

4.2 Convergence of Cournot adjustments in PUSGs 

To formally state and prove the convergence result, we need the following proposition from numer¬ 
ical analysis. 

Lemma 5. (Convergence of power iteration m)-- For any positive square matrix A whose eigenvalue 
with the largest modulus is A and the corresponding eigenspace is E, let xq be an arbitrary unit vector 
such that X is not orthogonal to E. Let 

, Ax^-^ 

^ “ \\Ax^-%' 

It is guaranteed that x* converges to x*, where Ax* = \x*. Moreover, 

Vp G Z"*" U {oo}, 3r G (0,1), c G M"*", s.t. 

\\x^ — x*\\p < cr^. 

In presence of LemmaEl we now state a convergence result of best response dynamics in PUSGs. 
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Theorem 4.1. If both players follow Cournot adjustment, the strategy sequence (x*,?/*) converges 
to the unique NE of the PUSG with an exponentially decreasing error. Or in other words, there is 
a linear convergenc^ for to the NE, following Cournot adjustment. 

Proof. Let A and B be the payoff matrices. We can explicitly derive the strategy expressions of 
Cournot adjustments in round t as follows, 

X* = Ay^~^, y* = Bx^~^. 

It follows that 

x^^ = {ABfx^, = {BAfy'^, VA: E N. 

Since we choose x° > 0, y® > 0, by Lemma O it is impossible that x° (resp. y^) is orthogonal to 
the eigenspace of the spectral radius of AB (resp. BA). By Lemma [5l as k grows, x^^ converges to 
the positive unit eigenvector of AB exponentially fast, and converges to that of BA. Therefore 
(x 2 fc,y 2 A:) converges to the unique PSNE exponentially fast. As (x^^,y^^) converges, (x^^"*“^, y^^"*“^) 
converges as well, concluding the proof. □ 


5 Approximating mixed-strategy equilibrium in standard games 
via USGs 

It is well-known that computing a mixed-strategy Nash equilibrium (MSNE) in standard two-player 
games is PPAD-complete [3]. In this section, we show that our understanding of USG can help us 
to compute an approximate MSNE for any standard games. 


5.1 Approximation Scheme 

Consider any PUSG. By theorems we have derived so far, one can easily compute the unique NE 
(x, y) of the PUSG. We now normalize x and y to be x' and y', so that ||x'||i = ||y'||i = 1. Our main 
finding is that (x', y') is a multiplicative O ^Y^max(m, n)^-approximate MSNeH for the underlying 
standard two-player game. 

Gall this approximation scheme the simple approximate scheme. 


5.2 Approximation via simple approximate scheme 

Once again, before we state and prove our result, we need the following technical lemma. 


Lemma 6. 


min 

xSK**, ||a;||i=l 




■^Linear convergence is another way of saying the error diminishes exponentially fast in the number of iterations. 
multiplicative k-approximate MSNE denotes a strategy profile where no player can improve her utility by k 
times via deviation. 






Proof. Let t = ||x||oo > Obviously, 

||x||i ^t‘^ + in-1) (^) 

Halloo t 

n 2 1 

—-^ r 

n — 1 n — 1 t{n — 1) 

^ 2 

~ y/n + 1' 

□ 


We are now ready to state our main result of the section. 

Theorem 5.1. For any standard two-player game with payoff matrices A and B, the simple ap¬ 
proximation scheme yields a multiplicative O max(m, n)^ -approximate MSNE, where m is the 
number of rows of A, and n is the number of rows of B. 

Proof. Let {x,y) be the NE of the induced PUSG over payoff matrices Ax B, and {x',y') be the 
normalized vectors, as stated in the simple scheme. Since {x,y) is an NE in the PUSG, 3A, /r, s.t. 

Ay' = Ax', Bx' = pLy'. 


Consider player one’s payoff with or without deviation. 
Without deviation, she gets 

ui{x' ,y') = x''^Ay' = A||x '||2 


By deviation, she gets 


By Lemma [6l 


max ui{xi,y') = max xjAy' 
l|a;i||i=i ll^illi=i 

= A max xfx' = A||x'||oo- 
ll^i||i=i 


min - 

{x',y') maxi 


ui{x',y') 


= mm 


> 


(x',y') ||X 


1 m — 2y/rn + 1 
^/m y/rn{m — 1) 



Symmetrically, for player two, the approximate factor becomes O 



□ 


6 Multiplayer PUSGs 

Definition 3. An m-player PUSG is defined as {A ^^..., AF'), where A^ = (^A^^ is the game 

tensor for player k, such that A^_^^ > 0 for all ii £ [ni ],... Am ^ [nm\- 
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6.1 Existence of NE in multiplayer PUSGs 

Lemma 7. (Brouwer’s fixed point theorem): For any n E Z"*", 12 C M"- which is compact and 
convex, / : 12 —>■ 12 which is continuous, there is some x* E 12 such that f{x*) = x*. 

Theorem 6.1. There exists an NE for any m-player PUSG , A’^). 

The proof resembles that of the existence of MSNE in normal form games. 


Proof. Let 

D 12 = |x E M®"" 
For all X = (xi,..., Xm) E 12, 

^^X2X3 ... Xi 


Si — Uj , 
j<i 


fix) = 


Xij > 0, Vi € [m],j E [Ui], ||xi||i = IVi E [m]| . 
^ A'^XiXs . . .Xm A™XiX 2 . . .Xm-l 


plx2X3 . . . X^lli ’ WA'^XiXs . . . Xmlii ’ ■ ■ ■ ’ P™XiX2 • • • Xm-l||l, 

It is easy to verify that 12 and / satisfy the conditions in Lemma [71 Therefore there is some x* = 
(x^,..., Xm) satisfying f{x*) = x*, which implies that there is some Aj such that A*x^ ... Xm = AjX* 


for all i. So 


Filb' 


is an MSNE of the PUSG. 


□ 


Note that, this theorem also follows from [U Theorem 1]. 


6.2 Subclasses of multiplayer PUSGs 

In this subsection, we investigate several subclasses of multiplayer PUSGs where NE is easy to find. 


6.2.1 Symmetric PUSGs with even number of players 


First we give an algorithm that solves m-player symmetric PUSGs when m is even. 

Definition 4. An m-player symmetric PUSG is a PUSG where A’' = A^ for all i,j E [m], and 
~ o-(im) f^x all k E [m] and a E Sn, where Sn is the permutation group and n is 

the number of actions of each player. 


The method used to find NE is symmetric PUSG is called SS-HOPM. SS-HOPM outputs a 
symmetric NE with a particular payoff (which equals the largest Z-eigenvalue of the payoff tensor). 
The linear convergence of SS-HOPM has been shown originally in [5] and revised in [2]. 

The algorithm performs as the following. 


1 . 

2 . 

3. 


Ghoose x° > 0, and the shift constant a = 
Let y^~^^ = -|- ax*. 

Gompute 

x“ 


„i+l 




As shown in |2], X* converges to an symmetric PSNE x* while A* converges to the payoff of each 
player playing x*. 
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6.2.2 Markov PUSGs via Cournot Adjustments 

Generalizing [6], we show that a unique PSNE exists in any Markov PUSG, which can be efficiently 
achieved via Cournot adjustments. 

Definition 5. An Markov PUSG (A^,..., A"*) is a PUSG such that 

'^k 

for all k e [m],ii G [m ],..., ife-i £ [nk-i], h+i G [^fc+i], • • • Am G [nm] and a constant Ck■ 

In words, Markov PUSG is a subset of PUSG such that, fixing any other players’ strategy 
profile, the sum of player fe’s utility over all his/her actions is a constant, for any k. Since every 
Markov PUSG can be scaled such that for all k, Ck = 1, it is without loss of generality to consider 
only those games. 

Lemma 8. For nonnegative xi,..., Xm such that ||xj||i = 1, we have ||A^xi... Xk-iXk+i ... x„||i = 
1 for all k G [m]. 

Proof. 


\\A^xi. ..Xk-iXk+i ■. .x^lli 


A^ 


E E ■ ■ ■ a:k-l,ik-l^k+l,i^:+l ■ ■ ■ Xm,i 

V Afc —1 Afc+1 5*** A' 

• • • ^k-l,ik-iXk+l,iu+i ■ ■ ■ 


. X. 


m,tm 


^1 ? • •—1 Afc+1 »• • • 

• • • Xm,in 


^1 )•** Afc —1 
= 1 


□ 


Lemma 9. Let O be as defined above, f : kl ^ kl be such that for v ^ LI, 

f{v)k = A^m . . . Vk-lVk+l ...Vra- 
For X = (xi,... ,Xm) ^LL, y = (yi, ...,ym)^LL, 


ll/(a;)fc -/(y)fclli < (1 - 4 ) ( ^ \\xi-yi\\i 


k 2E[m], i^k 


where 


6k = min 

ycK] 


mm 


ikGV 


+ 


n, 


mm 


ik&V 


and V' = [uk] \ V. 
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Proof. 


if{x)k,ik - fiy)k,ij 

'k^^k 


'^k^'^k 

^^ ^^ ‘n,...,im (^l,n ■ ■ ■) l,*fe_i) ®fe+l,ife+i) ■ ■ • ) 

ik&Vk + 

yi,il • • • ) yk—l,ik-i ) yk+l,ik+i ) • • • ) ym,im ) 

E E - yi,h)x2,y2 ■ ■ ■ 


Xm,im 


1 Xm,im~^ 


yi,ilix2,i2 2/2,12)^3,13 ■ ■ ■ Xm,im + ' ' ' + 2 / 1 ,ii ■ ■ ■ 2 /m—l,im-l(' 


X. 


' ym,im)]- 


Vk C [rik] be the largest set such that Vi^ G 14 , f{x)k,i^ > f{y)k,ik^ ^ [’^i] the largest 
such that Vii G 14, > 2/i,ji- Note that by Lemma [HI ||xfc||i = ||2/fc||i = 1, and hence 

Xk,ik ~ yk,ik = 0 for all k G [m]. We then have 


_ _ 2 / 1 , 11 ) 2 / 2,12 •■■) 

+ ik&Vk 


ym,iri 


E E '4.... 

— ^ii,...,im(^l,*i ~ 2/l,ii)2/2,i2 • ■ ■ ) 2 /m,im + 

nSVi i2,...,ifc_i,ii;+i,...,*m ik&Vk 

E , . E E '4.. 

il^Vl + ifcSVfc 

^ Z] Z ■ ■ Z 

^ I Jl6Vl,j2,.--Jm 

*l6ri I2,...,*fe-l,*fe + l,...,*m \ 3k&Vk 


(xi,ii yi,ii)y2,i2 • ■ ■ ) ym,im 

■■■I'im 'ifcSV/c 

(311,11 ~ 2/1,11)2/2,12 • • • ) ym,im~ 


Z Z (1 ,• Z 

ll^Vl 12,---,lfe-l,lfe + l,...,*m \ Jk&Vk / 


( 2 / 1,11 31i^ji )?/2,i2 • • • 7 ym,i„ 


max > ^ 7 i,... 7 m “ 


mfo Z ^317- 


mill 

'ifceVfe 

E E ( 2 / 1,11 - 3 : 1 , 11 ) 2 / 2 , 

n^Vi i2,---,ife-i,ife+i,...,'' 


,i™^| X 

,12 ‘ ‘ ‘ 2/fc —+ • • • Vm^ir) 


< 1 


- min 

il vijr? 


Z ^ii7-7i- “ “in Y ^h,-,3m Y “ 

jk^Vk jk&Vk / il^Vl 


Xi 


,^1 2 


< ^(1 - 4)||a:i - 2/i||i- 


12 


We therefore get 


\\f{x)k - f{y)k\\i = ‘^ - fiy)k,0 


i^k 


-(1 - 6k)\\xi - yi\\i 


(1 - 4 ) ( ^ \\xi - yi\\i 

i^k 


□ 


Theorem 6.2. There exists an unique NE in any Markov PUSG where 6k > for dll k. 


Proof. Assume there are two distinct NE in an m-player game (A^,..., A™), xq and yo- Let 

"" = Wp y = ]&■ Lemmal 


\\x - y\\i 

=Y 11 ®*^ “ 

k 

k 

k i^k 


ie[m] 

=11^; - y||i, 


m — 1 


an contradiction. 


□ 


Theorem 6.3. Cournot adjustments lead to the unique NE in any Markov PUSG where 6k > 
for all k. 

Proof. For simplicity, we denote strategies by vectors whose Li-norm are scaled to 1 in the proof. 
Consider a procedure where player k starts by playing x^ = ..., Let x* be the unique 

PSNE of the game, guaranteed to exist by Theorem 16.21 Let cq = maxjg[^] ||x? — x*\\i, 6 = 
maxjg[m](l — 6i). Clearly, in round t, strategies of player k will be xj, = f(x*~^)k, where / is the 
same as stated above. 

On the other hand, as shown in Lemma [U 


xi 




< 5 



, Vt G Z~^, k G [mj. 


By a simple induction, we prove that 


et = max [\\xl - x^Hi] < (m - 1)^5 *cq. 

k 


13 









When t = 0, it holds obviously that eo < cq- Assume that ct-i < {m — 1)^ ^6^ ^eo, we may show, 


Cf = max 
k 
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< max (5 
k 



t-1 


- Xi 


Ii < 6(m - < (m- 1)^(5*eo- 


It can be seen easily that e* goes to 0 exponentially fast considering that S < 

We have shown that the Li-norm normalized strategies converge to the Li-norm normalized 
NE. It follows naturally that the strategies themselves converge to the unique NE. Morevoer, the 
convergence is linear, i.e., the error decreases exponentially fast. □ 


6.3 Multiplicity of NE in multiplayer USGs 

In fact, there may be infinitely many NEs in a multiplayer USG. Here is an interesting example [2]. 


Example 3. Consider a 4-p^o,y^x USG where game tensors 


^2122 


- 42 

— ^1112 


- 42 

— ^1222 


- 43 

— ^1112 


- 43 

— ^2122 


— 4"^ 

— ^1121 


— ^2122 — 2 , Al ^ 


*1*2*3*4 

symmetric strategy x = (xi,X 2 )- In order for x to be an NE, we need 


= 0 otherwise. 


are such that 


We consider 


2xfx2 = Axi 
2x1X2 = Ax 2 
xf + x| = 1 


By setting A = 2xiX2, it appears that any pair of (xi,X 2 ) where x2 + x^ = 1 forms a symmetric 
NE. Moreover, any equilibrium payoff X G [0,1] can be achieved by some choice o/(xi,X 2 ). 
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